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(2) INFORMATION FOR SEQ ih NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2703\base pairs 

(B) TYPE: nucleia acid 

(C) STRANDEDNESS :\ single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION^ SEQ ID NO:l 
CCATGGCCTC GAAAGGGCAG CGGTGATtVt TTTCACATAA ATATATCGCA CTTAAATGAG 
TTTAGACAGC ATGACATCAG AGAGTAATtV AATTGGTTTG GGTTGGAATT CCGTTTCCAA 
TTCCTGAGTT CAGGTTTGTA AAAGATTTtA CTGAGCACCT GCAGGCCTGT GAGTGTGTGT 
GTGTGTGTGT GTGTGTGTGT GTGTGTGTGA VgTATTTTc/cTGGAAAGGA TTCAAAACTA 
GGGGGAAAAA AAAACTGGAG CACACAGGCA gVaTTACGCC ATTCTTCCTT CTTGGAAAAA 
TCCCTCAGCC TTATACAAGC CTCCTTCAAG CcVTCAdTCAGTTGTGCAGG AGAAAGGGGG 
CGGTTGGCTT TCTCCTTTCA AGAACGAGTT ATtW^cT^CTGACTGGA GACGGTGCAC 
GTCTGGATAC GAGAGCATTT CCACTATGGG AO^TAc/ *ACACACACC CGGCAGACTT 480 
CAAGAGTCTC AGACTGAGGA GAAAGCCTTT ZcTtL^TG CTACTGCTGC TGCCGCTGCT 540 
TTTGAAAGTC CACTCCTTTC ATGGTTTTTc/cTG^G€-AGAGGCACCT TTGCTGCTGC 600 
CGCTGTTCTC TTTGGTGTCA TTCAGCGGCT GGcIIgLgA ^GAGACTCCC CAAACTCCTC 660 
ACTTTCTTGC TTTGGTACCT GGCTTGGCTG GACJcTGgL/tCATCTGCAC TGTGTTGGGT 720 

gcccctgact tgggccagag accccagggg acLgAg GATTGGCCAA AGCAGAGGCC 780 

AAGGAGAGGC CCCCCCTGGC CCGGAACGTCJ5CAGGCC«G GGGGTCACAG CTATGGTGGG 840 
GGGGCCACCA ATGCCAATGC CAGGGCAAaI GGAGGCACCG GGCAGACAGG AGGCCTGACA 900 
CAGCCCAAGA AGGATGAACC CAAAAAGCTG^CCCCCAGAcWcGGCCC TGAACCCAAG 
CCAGGACACC CTCCCCAAAC AAGGCAGGCT ACAGCCCGGA CTGTGACCCC AAAAGGACAG 
CTTCCCGGAG GCAAGGCACC CCCAAAAGCA GGATCTGTCC GCAGCTCCTT CCTGCTGAAG 
AAGGCCAGGG AGCCCGGGCC CCCACGAGAG CCCAAGGAGC CGTTTCGCCC ACCCCCCATC 
ACACCCCACG AGTACATGCT CTCGCTGTAC AGGACGCTGT CuGATGCTGA CAGAAAGGGA 
GGCAACAGCA GCGTGAAGTT GGAGGCTGGC CTGGCCAACA CcIaTCACCAG CTTTATTGAC 1260 
AAAGGGCAAG ATGACCGAGG TCCCGTGGTC AGGAAGCAGA GgUcGTGTT TGACATTAGT 
GCCCTGGAGA AGGATGGGCT GCTGGGGGCC GAGCTGCGGA TCTTGCGGAA GAAGCCCTCG 
GACACGGCCA AGCCAGCGGC CCCCGGAGGC GGGCGGGCTG CCCAGCTGAA GCTGTCCAGC 
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1080 
1140 
1200 



1320 
1380 
1440 
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TGCCCCAGCG GCCGGCAGCC GGCCTCCtIg CTGGATGTGC GCTCCGTGCC AGGCCTGGAC 
GGATCTGGCT GGGAGGTGTT CGACATCTGjS AAGCTCTTCC GAAACTTTAA GAACTCGGCC 
CAGCTGTGCC TGGAGCTGGA GGCCTGGGaI CGGGGCAGGG CCGTGGACCT CCGTGGCCTG 
GGCTTCGACC GCGCCGCCCG GCAGGTCCAc\ GAGAAGGCCC TGTTCCTGGT GTTTGGCCGC 
ACCAAGAAAC GGGACCTGTT CTTTAATGAg\aTTAAGGCCC GCTCTGGCCA GGACGATAAG 
ACCGTGTATG AGTACCTGTT CAGCCAGCGG LaAAACGGC GGGCCCCACT GGCCACTCGC 
CAGGGCAAGC GACCCAGCAA GAACCTTAAG (WtGCA GTCGGAAGGC ACTGCATGTC 
AACTTCAAGG ACATGGGCTG GGACGACTGG aWtCGC^CCTTGAGTA CGAGGCTTTC 
CACTGCGAGG GGCTGTGCGA GTTCCCATTG CgU^CACc/gGAGCCCAC GAATCATGCA 
GTCATCCAGA CCCTGATGAA CTCCATGGAC CcLgTc/ CACCACCCAC CTGCTGTGTG 
CCCACGCGGC TGAGTCCCAT CAGCATCCTC tJcATT^ACT CTGCCAACAA CGTGGTGTAT 
AAGCAGTATG AGGACATGGT CGTGGAGTCG xi^C^GGTAGCAGCA CTGGCCCTCT 
GTCTTCCTGG GTGGCACATC CCAAGAGCCC CP?3W TCCTGGAATC ACAGAGGGGT 
CAGGAAGCTG TGGCAGGAGC ATCTACACAG cUi^ AGGGGATTCC AATAAGCTTG 
CTCGCTCTCT GAGTGTGACT TGGGCTAAAG JcCX^TTT ATCCACAAGT TCCCCTGGCT 
GAGGATTGCT GCCCGTCTGC TGATGTGACC P 
TCTGAATGGG ACTGAGTCCC AGGAAACAGT G| 
TCCTCACCTG GGCCTTCTCA GCCTCTGGAC tWcTAAGC ACCTCTCAGG AGAGCCACAG 
GTGCCACTGC CTCCTCAAAT CACATTTGTG Cc4gTGaVt TCCTGTCCCT GGGACAGTTG 
AGAAGCTGAC TGGGCAAGAG TGGGAGAGAA GAGGAGAGG^ CTTGGATAGA GTTGAGGAGT 
GTGAGGCTGT TAGACTGTTA GATTTAAATG TATATTGATG. AGATAAAAAG CAAAACTGTG 
CCT 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Arg Leu Pro Lys Leu Leu Thr Phe Leu Leu Trp Tyr Leu Ala 

1 5 10 

Leu Asp Leu Glu Phe lie Cys Thr Val Leu Gly Ala Pro Asp Leu 



ITC&eT&GGC ACAGGTCCAG GGAGACAGAC 

:ttt/c^at gagactcagc ccaccatttc 
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Trp 
Gly 



ij 

it l 
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Gin Arg Pro Gin Gly Thr Arg 
35 

Glu Arg Pro Pro Leu Ala Arg 
50 55 



39 



25 



30 



\ 

Pro Gly Leu Ala Lys Ala Glu Ala Lys 
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Asn Val Phe Arg Pro Gly Gly His Ser 
60 

Tyr Gly Gly Gly Ala Thr Asn U Asn Ala Arg Ala Lys Gly Gly Thr 
65 70 
Gly Gin Thr Gly Sly Leu Thr Glh Pro Lys Lys Asp Glu Pro Lys Lys 



Leu Pro Pro Arg Pro Gly Gly 



pio Glu Pro Lys Pro Gly His Pro Pro 



100 



105 



110 



u * rm Ah Thr Ala aA Thr Val Thr Pro Lys Gly Gin Leu 
Gin Thr Arg Gin Ala inr axa niy 125 

115 ¥V 

A JklnT^Xi v Ser Val Pro Ser Ser Phe 
Pro Gly Gly Lys Ala Pro Pro iJf^Ala <^ly Ser Val 



130 



135 



Leu Leu Lys Lys Ala Arc Qli P o Gly/pro Pro Ar, Glu Pro Ly, Glu 

iak 150 \ / iDD 



145 

TlQ Tvi^Prn His Glu Tyr Met Leu Ser Leu 
Pro Phe Arg Pro Pro Pro lie Ttfe^oH^ biu y ^ 

165 * * lfl> 



Tyr Arg Thr Leu Ser Asp Ala Asj 
180 

Lys Leu Glu Ala Gly Leu Ala Asrj 
195 2UU 

Gly Gin Asp Asp Arg Gly Pro V 



Irg Lys 
1B5 



Tfcr 13 



;iy Gly Asn Ser Ser Val 
190 

Thr Ser Phe He Asp Lys 
205 



210 



Arg Lys Gin Arg Tyr Val Phe 
220 



Asp He ser Ala Leu Glu Lys Asp M Leu Leu Gly Ala Glu Leu Ar, 

225 230 \ 

He Leu Ar, Lys Lys Pro Ser Asp ThrWa Lys Pro Ala Ala Pro Gly 

245 

sly Gly Ar, Ala Ala Glh Leu Lys Leu Ser S er Cys Pro Ser Gly Ar, 



260 



Gin Pro Ala Se 
275 



r Leu Leu Asp Val Arg S^r Val Pro Gly Leu Asp Gly 



280 



ser Gly Trp Glu Val Phe Asp He Trp Lyls Leu Phe Ar, Asu Phe Lys 

290 z ^ \ 

Asn Ser Ala Glh Leu Cys Leu Glu Leu GlA Al. Tip Glu Ar, Gly Ar, 



305 

Ala Val Asp Leu Arg 
325 



310 



Gly Leu Gly Phe Asp\Arg Ala Ala Arg Gin Val 



330 



\ 

His Glu Lys Ala Leu Phe Leu 
340 

Leu Phe Phe Asn Glu He Lys 
355 

Val Tyr Glu Tyr Leu Phe Ser 
370 375 * 

Ala Thr Arg Gin Gly Lys Arc 
385 390 

Ser Arg Lys Ala Leu His Val 
405 

Trp He He Ala Pro Leu Glu 
420 



Cys Glu Phe Pro Leu Arg' 
435 

He Gin Thr Leu Met Asn 
450 



Cvs Cys Val Pro Thr Arg L 
465 470 

Ser Ala Asn Asn Val Val T 
485 

Ser Cys Gly Cys Arg 
500 



(2) INFORMATION FOR SEQ ID NO: 3 
(i) SEQUENCE CHARACTERISTIC 
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Val Phe Gly 
345 

Ala Arg Ser 
360 

Gin Arg Arg 
Pro Ser Lys 



Asn Phe Lys 
410 

Tyr Glu Ala 
425 




Lys Gin Tyr 
,490 




Arg Thr Lys Lys Arg Asp 
350 

Gly Gin Asp Asp Lys Thr 
365 

Lys Arg Arg Ala Pro Leu 
380 

Asn Leu Lys Ala Arg Cys 
395 400 

Asp Met Gly Trp Asp Asp 
415 

Phe His Cys Glu Gly Leu 
430 

Pro Thr Asn His Ala Val 
445 

, Glu Ser Thr Pro Pro Thr 
460 

Ser He Leu Phe He Asp 
475 480 

Glu Asp Met Val Val Glu 
495 



(A) LENGTH: 24 base pai^Sj 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AGAATTCGCA TGCCATGGTC GACG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



24 



41 



k? 5 



CTTGAGTACG AGGCTTTCCA CTG 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
ATTCGCATGC CATGGTCGAC GAAG > 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS:^ 

(A) LENGTH: 23 ba'se pairs 

(B) TYPE: nucleic a^eid 

(C) STRANDEDNESS 
* (D) TOPOLOGY: li y 

W (ii) MOLECULE TYPE: DNA\ 

|S (xi) SEQUENCE DESCRIPTION^ SEO ID NO: 6: 

GGAGCCCACG AATCATGCAG TCA 
(2) INFORMATION FOR SEQ ID 

(i) SEQUENCE CHARACTERISTICS 
?n (A) LENGTH: 23 base* pax] 

Y~ s (B) TYPE: nucleic acicr 

^ y (C) STRANDEDNESS: single 

I- (D) TOPOLOGY: linea^" 




(ii) MOLECULE TYPE: DNA ( 
(xi) SEQUENCE DESCRIPTION: SE£ ID NO:7: 
ACAGCAGGTG GGTGGTGTGG ACT 1 
(2) INFORMATION FOR SEQ ID NO: 8: ■ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid : 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ; 

(ii) MOLECULE TYPE: DNA '. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AGAATTCGCA TGCCATGGTC GACGAAGCTT TTTTTTTTTT TTTT 



23 



24 



23 



23 



44 
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(2) INFORMATION FOR SEQ IQ NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 fcbse pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSU single 

(D) TOPOLOGY : lihear 

(ii) MOLECULE TYPE: DN? 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
CCAGCAGCCC ATCCTTCTCC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base\ pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear^ 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTJ^ 

TCCAGGGCAC TAATGTCAAA CA^ 
(2) INFORMATION FOR SEQ ID m 



20 



NO:10: 



24 



Q V6 NO:ll: 



(i) SEQUENCE CHARACTERISE 

(A) LENGTH: 24 base 

(B) TYPE: nucleic 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: S 
ACTAATGTCA AACACGTACC TCT 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID 

AGCGGCCGCT 

(2) INFORMATION FOR SEQ ID NO: 13: 



24 



10 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 102 amino fields 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: 
Cys Ser Arg Lys Ala Leu H^ 



s VaJ 



Asp Trp He lie Ala Pro Leu, 



IV NO: 13: 

Lsn Phe Lys Asp Met Gly Trp Asp 
10 16 

Ku Tyr Glu Ala Phe His Cys Glu Gly 



20 



s.25 



30 



WHis Leu Glu Pro Thr Asn His Ala 



45 



Leu Cys Glu Phe Pro Leu Arg a 

VI U. l Thr u. « AsX It »sp Pro «» Ser Thr Pro Pro 

50 55 

ir Pro He Ser He Leu Phe lie 
75 



Thr Cys Cys Val Pro Thr Arg Leu Se 
65 70 

Asp Ser Ala Asn Asn Val Val Tyr Ly 

8 5 



80 



Gin Tyr Glu Asp Met Val Val 



90 



95 



Glu Ser Cys Gly Cys Arg 
100 




1-5. 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amipo acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: \ , 

(A) NAME /KEY : misc_f eatu're 

(B) LOCATION: 60 \ ' 

(C) OTHER INFORMATION: 
comparison purposes" 

(xi) SEQUENCE DESCRIPTION: 

Cys Lys Arg His Pro Leu Ty 
1 5 



/note- "Xaa is a gap inserted only for 



SE 



NO: 



ial Asp PI 



le Ser Asp Val Gly Trp Asn 
? 15 



RSP Trp lie val M. Pro Pro X Ty/»is «. Ph. Tyr Cys His SI, 

20 V y 

g1o Cys Pro Phe Pro Leu Ala S « If »" ^ 

U e val Gin T h r Leu val Asn serial *s„ ^ Xaa Lys lie Pro Lys 

50 55 \\ / 



Ala Cys Cys Val Pro Thr Glu Leu 
65 70 



S^r Ala tie Ser Met Leu Tyr Leu 
75 80 



isp Clu As„ Glu Lys v.l val Leu ,\\A Tyr Gin *sp Met val Val 

8 5 x x \ 

Glu Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids \ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: c ^ ^ 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 60 inserted only for 

(C) OTHER INFORMATION: /note- Xaa is a gap 

comparison purposes" ^ 
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(xi) SEQUENCE DESCRIPTION* SEQ ID NO: 15: 



VA-U/ v^*-.^ i 

Cy s Ar, Aro His |er Leu L Val Asp PJe Ser Asp ».l Gly Trp Asn 

Lp Trp II. val Ma Pro plo Oly Tyr Gl„ Ala Ph. Tyr Cys His 01, 

20 I " 

Asp cys Pro Phe Pro Leu ,1a Asp His Leu Asn Ser Thr Asn His Al, 

He V.1 Gin Thr Leu v,i 1 Ser val Asn S.r Xaa ser He Pro Lys 

50 55 \ 
Ala Cys cys val Pro Thr Gil Leu S.r Al. U. Ser Hat Leu Tyr Leu 



65 

Asp Glu Tyr Asp Lys Val Va: 
85 ' 



, L eu Lys Asn Tyr Gin Glu Met Val Val 
90 9b 



Glu Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO:l6: 

(i) SEQUENCE CHARACTERISTIC 

(A) LENGTH: 102 amino \ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEC 

Cys Lys Lys His Glu Leu Tyr V 
1 5 



NO: 16: 

U\\Ser P/e Arg Asp Leu Gly Trp Gin 
0 15 

Ala Ala Phe Tyr Cys Asp Gly 
30 



ryr 



Asp Trp He lie Ala Pro Glu 

G1 „ cys S.r Phe Pro Leu Asn AlUis «.t Asn Ala Thr Asn His Ala 

H. val gL Thr Leu Val His Leu \ Phe Pro Jj, His Val Pro Lys 

50 55 \\ 

Pro cys Cys Ala Pro Thr Lys Leu A Ala 1U Ser Val Leu Tyr Phe 
65 10 \ 

Asp Asp ser S.r Asn Val U. Leu 1*1 Lys Tyr Ar 9 Asn Met Val val 

85 



Arg Ser Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 
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(B) TYPE: amino acid\ 
(D) TOPOLOGY: linear] 

(ii) MOLECULE TYPE: protei 
(xi) SEQUENCE DESCRIPTION: |sEQ ID NO: 17 

Cys Arg Lys His Glu Leu Tyl Val Ser Phe Gin Asp Leu Gly Trp Gin 

1 5 \ 

Asp Trp lie lie Ala Pro Lyiciy Tyr Ala Ala Asa Tyr Cys Asp Gly 

20 » 2b 



Glu Cys Ser Phe Pro Leu^/sn] 
35 

He Val Gin Thr Leu Val His 
50 55 



Ua His Met Asn Ala Thr Asn His Ala 

45 

4et Asn Pro Glu Tyr Val Pro Lys 
60 



Pro Cys Cys Ala Pro Thr LysXj^ Ala lie Ser Val Leu Tyr Phe 
65 70 ft I 



Asp Asp Asn Ser Asn Val He L€ 
85 * 

Arg Ala Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 18: 



Lys Tyr Arg Asn Met Val Val 
90 95 



(i) SEQUENCE CHARACTERISTICS: | 

(A) LENGTH: 102 amino acid^ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear ^ 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID lb: 18: 

Cys Lys Lys His Glu Leu Tyr Val Se^ Phe Arg Asp Leu Gly Trp Gin 



Asp Trp He He Ala Pro Glu Gly Ty 
20 2b 

Glu Cys Ala Phe Pro Leu Asn Ser Tyr 
35 40 

He Val Gin Thr Leu Val His Phe He 
50 55 

Pro Cys Cys Ala Pro Thr Gin Leu Asn 
65 70 



Ala Ala Tyr Tyr Cys Glu Gly 



30 



Met Asn Ala Thr Asn His Ala 
45 

Asn Pro Glu Thr Val Pro Lys 
60 

Ala He Ser Val Leu Tyr Phe 
75 80 



Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
8 5 



Arg Ala Cys Gly Cys His 
100 



(2) INFORMATION FOR SEC 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 3 

(B) TYPE: nuc 

(C) STRANDEDN 



ID NO:19: 



5 base pairs 
. _eic acid 
;SS: single 



[Q ID NO: 20: 



(21: 



:s: 



(D) TOPOLOGY: 1 linear 
(ii) MOLECULE TYPE: \DNA 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
ATGAATTCCC ATGGACCTGG GcIgGMAKGA MTGGAT 
(2) INFORMATION FOR SEQ IP NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 &*se pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNEBS^il^e 

(D) TOPOLOGY: Knear \ 

(ii) MOLECULE TYPE: E 
(xi) SEQUENCE DBSCRIP 
ACGTGGGGTG GAATGACTGG AT 
(2) INFORMATION FOR SEQ ID 

(i) SEQUENCE CHARACTE 

(A) LENGTH: 22 b 

(B) TYPE: nuclei 

(C) STRANDED^ESS, y 

(D) TOPOLOGY: linea 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: 

ATATTGGCTG GAGTGAATGG AT 
(2) INFORMATION FOR SEQ 

(i) SEQUENCE CHARACTER^ 

(A) LENGTH: 22 base _ 

(B) TYPE: nucleic acid , 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi ) SEQUENCE DESCRIPTION: SEQ ^D NO:22 
ATGTGGGCTG GAATGACTGG AT 
(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 



ncs: 

>airs 



ID NO:21: 
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U) NO: 24: 



{A) LENGTH: 22 bdse pairs 

(B) TYPE: nucleic! acid 

(C) STRANDEDNESS:\ single 

(D) TOPOLOGY: linfear 

(ii ) MOLECULE TYPE: DNAl 
(8i) SEQUENCE DESCRIPTIO^: SEQ ID NO:23: 
ACCTGGGCTG GCAGGACTGG AT 
(2 ) INFORMATION FOR SEQ ID NC-:24: 
m SEQUENCE CHARACTERISTICS: 
11 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic aiid 

(C) STRANDEDNESS: S3»™1 

(D) TOPOLOGY: lineal 

(ii ) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTI 
AGGACCTCGG CTGGAAGTGG AT 
(2) INFORMATION FOR SEQ I 

(i ) SEQUENCE CHARACTERISE 
U (A) LENGTH : 22 base 

(B) TYPE: nucleic aci 

(C) STRANDEDNESS: sir 

(D) TOPOLOGY: linear^ 

MOLECULE TYPE: DNA 
(xi ) SEQUENCE DESCRIPTION: S! 
GGGATCTAGG GTGGAAATGG AT 
(2 ) INFORMATION FOR SEQ ID NO: 26 

{i) SEQUENCE CHARACTERISTICS^ 
1 ' (A) LENGTH: 22 base parf\ 
B) TYPE: nucleic acid v 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID^NO: 
AGGATCTGGG CTGGAAGTGG GT 
(2 ) INFORMATION FOR SEQ ID NO:27: 
(i) SEQUENCE CHARACTERISTICS: 
U) (A) LENGTH : 22 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



/NO: 28 



Sirs 



(D) TOPOLOGY: linear 
(ii ) MOLECULE TYPE: D*A 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
AGCTGGGCTG GGAACGGTGG AT 
(2 ) INFORMATION FOR SEQ Id\nO:28: 
m SEQUENCE CHARACTERISTICS: 
l) M LENGTH: 22 blse pairs 

(B) TYPE: nuclei- 4 a ^ ld 

(C) STRANDEDNESS 

(D) TOPOLOGY: li 

(ii) MOLECULE TYPE: DN^ 
ixi) SEQUENCE DESCRIPtI 
ACATCGGCTG GAATGACTGG AT 
(2 ) INFORMATION FOR SEQ ID 

m SEQUENCE CHARACTERS 

(A) LENGTH: 22 base 

(B) TYPE: nucleic = 

(C) STRANDEDNESS: 

(D) TOPOLOGY: line^ 

(ii) MOLECULE TYPE: DNA 
(xi ) SEQUENCE DESCRIPTION: 
TCATCGGCTG GAACGACTGG AT 
(2 ) INFORMATION FOR SEQ ID NO: 

(i ) SEQUENCE CHARACTERISE 
1 (A) LENGTH: 29 base'p^irs 

(B) TYPE: nucleic acld \ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear t 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SE^ ID NO 

ATGAATTCGA GCTGCGTSGG SRCACAGCA 

(2 ) INFORMATION FOR SEQ ID NO: 31: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

( B ) TYPE: nucleic acid I 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii ) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
GAGTTCTGTC GGGACACAGC A 
(2) INFORMATION FOR SEQ ttD NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 \base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DflA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
CATCTTTTCT GGTACACAGC A 
(2) INFORMATION FOR SEQ rto\NO:33: 



(i) SEQUENCE CHARACT 

(A) LENGTH: 21/ 

(B) TYPE: nuclei 

(C) STRANDEDNESS 

(D) TOPOLOGY: li 

(ii) MOLECULE TYPE: DNA^ 

(xi) SEQUENCE DESCRIPTIO^ SEQJLD^O: 33 : 
CAGTTCAGTG GGCACACAAC A 
(2) INFORMATION FOR SEQ ID NOi 

(i) SEQUENCE CHARACTERISE 

(A) LENGTH: 21 base I 

(B) TYPE: nucleic a< -T\7 

(C) STRANDEDNESS: siH&le 

(D) TOPOLOGY: lineal" 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: ^Q ID NO:34: 
GAGCTGCGTG GGCGCACAGC A 
(2) INFORMATION FOR SEQ ID NO: 35: 

Ml SEQUENCE CHARACTERISTICS^ 

(A) LENGTH : 21 base paits 

(B) TYPE: nucleic acid } 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ^IDNO:35 
CAGCGCCTGC GGCACGCAGC A 



• 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucieic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: ILinear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
TAAATCTTGG GACACGCAGC A 
(2) INFORMATION FOR SEQ IQ NO:37: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bdse pairs 

(B) TYPE: nuclei cUai5icP^ 

(C) STRANDEDNESS: \ single 

(D) TOPOLOGY / :/linear J 



(2) INFORMATION FOR SEQ ID NO:\38: 
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(ii) MOLECULE TYPE 

(xi) SEQUENCE DESCRIPtVoiK/sEQ ID NO: 37: 

u \ / 21 

CAGGTCCTGG GGCACGCAGC A 
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111 (i) SEQUENCE CHARACTERISTICS ; 
Q (A) LENGTH: 21 base\£airs^ 

l" (B) TYPE: nucleic ac ' 

U (C) STRANDEDNESS: si 

)% (D) TOPOLOGY: lineal 

W (ii) MOLECULE TYPE: DNA, 

5 (xi) SEQUENCE DESCRIPTION :\]sjsQ ID NO: 38: 

' y CCCTGGGAGA GCAGCACAGC A 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS!: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 1 

(C) STRANDEDNESS: singl<S 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

21 

CAGCTTGGTG GGCACACAGC A 
(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nuAeic acid 

(C) STRANDEDkfesS'^single 

(D) TOPOLO(pHS\ linear 



(ii) MOLECULE TYPE 
(xi) SEQUENCE DESCR 
CAGCTTGGTG GGAATGCAGC A 




SEQ ID NO: 40 



w 

m 
m 

m 
Q 

to 

W 

tD 

*0 



SEQ ID NO . 1 V 

TYPE OF SEQUENCE^: nucleic acid sequence 
NAME AND ORIGIN: \mP-52 DNA 
LENGTH: 2703 nucleotides 



GRG33 CAAGaS^CCCAGCAAGAACCITAAGG^ 

CTQGftftGCTOTOXaCSGAGCftTC^^ 
CiaSCirHCIGAGTOIGACITOG^^ 

ocrr 



SEQ ID\NO:2 

TYPE OF\SEQUENCE: amino acid sequence 
NAME ANdV)RIGIN: MP- 52 protein 
LENGTH: 50*1 amino acids 

MRLPKLLTFL LWYIiAWLDLE FICTVLGAPD LGQRPQGTRP GLAKAEAKER 
PPLARNVFRP GGH&YGGGAT NANARAKGGT GQTGGLTQPK KDEPKKLPPR 
PGGPEPKPGH PPQTOOATAR TVTPKGQLPG GKAPPKAGSV PSSFLLKKAR 
EPGPPREPKE PFRPPPITPH EYMLSLYRTL SDADRKGGNS SVKLEAGLAN 
ITSFIDKGQ DDRGPVVfeKQ RYVFDISALE KDGLLGAELR I LRKKPSDTA 
KPAAPGGGRA AQLKLSSCS^S GRQPASLLDV RSVPGLDGSG WEVFDIWKLF 
RNFKNSAQLC LiELEAWERck AVDLRGLGFD RAARQVHEKA LFLVFGRTKK 
RDLFFNEIKA RSGQDDKTVY NeYLFSQRRKR RAPLATRQGK RPSKNLKARC 
SRKALHVNFK DMGWDDWIIA F^EYEAFHCE GLCEFPLRSH L.EPTNHAVIQ 
TIiMNSMDPES TPPTCCVPTR LS^ISILFID SANNWYKQY EDMWESCGC R 



